Counting elements and geodesies 
in Thompson's group F 

Murray Elder* 

School of Mathematics and Physics, University of Queensland, Brisbane, Australia 

Eric Fusy 

LIX, Ecole Polytechnique, Paris, France 

Andrew Rechnitzer 

Department of Mathematics, University of British Columbia, Vancouver, Canada 



Abstract 

We present two quite different algorithms to compute the number of elements in the sphere 
of radius n of Thompson's group F with standard generating set. The first of these requires 
exponential time and polynomial space, but additionally computes the number of geodesies 
and is generalisable to many other groups. 

The second algorithm requires polynomial time and space and allows us to compute the 
size of the spheres of radius n with n < 1500. Using the resulting series data we find that 
the growth rate of the group is bounded above by 2.62167. . .. This is very close to Cuba's 
lower bound of 16|. Indeed, numerical analysis of the series data strongly suggests 

that the growth rate of the group is exactly '^^^ ■ 
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1. Introduction 

Let G be a group with finite generating set X. Recall that / : N — ?■ N is the (spherical) 
growth function for {G,X) if f{n) is the number of elements in the sphere of radius n 
of the corresponding Cayley graph. Define the (spherical) geodesic growth function to be 
(7 : N — > N where g{n) is the number of all geodesies of length n in the Cayley graph [14]. 
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In this article we give two quite different algorithms to compute the growth function of 
Thompson's group F . The first, Algorithm A, applies to a range of groups, and computes 
both /(n) and g{n). It runs in exponential time and polynomial space, and is implemented 
to compute the first 23 terms of both functions with moderate computer resources. This 
algorithm is based on a random sampling algorithm developed by the third author and van 



Rensburg for problems in lattice statistical mechanics [21 



The second algorithm. Algorithm B, is specific to Thompson's group F, and computes 
f{n) in polynomial time and space. It is based on the forest diagram representation of 
elements of F, introduced by Belk & Brown [ij, and the associated length formula, which 
itself is based on work of Fordham [l^. We implemented this algorithm and were able to 
compute the first 1500 terms of /(n), again with moderate computer resources. This data 
enables us to obtain an upper bound of 2.62167 ... on the growth rate of F, which differs by 
only 0.15% from the lower bound of obtained by Guba 16]. Based on this and other 
numeric evidence, we conjecture that Cuba's bound is indeed the correct growth rate. Both 
sets of data (the growth up to 1500 and geodesic growth up to 22)are published on the 
Online Encyclopedia of Integer Sequences as A156945 and A156946 23 . 

Note that when the spherical (geodesic) growth is exponential, the growth rate coin- 
cides with the usual growth and geodesic growth rates which count the number of elements 
(geodesies) of length at most n rather than exactly n. 

Many researchers have made an attempt to compute the growth of Thompson's group. 
Guba [l6|] used a brute force technique to compute the first 9 terms of the growth series. 
This was extended to the first 13 by Burillo, Cleary and Weist Matucci considered a 



system of recurrences obtained by considering forest diagrams in his thesis [20[, which turned 
out to be quite complicated. It is possible that his approach could be used to compute the 
growth series but this does not appear to have been pursued. Our own approach is to set 
up an algorithm (being Algorithm B) that enumerates the number of elements in the sphere 
of radius n by computing the number of forest diagrams of weight n. Using this we have 
been able to compute a large number of terms. One key ingredient in our algorithm is the 
encoding of forest diagrams by labelling both the gaps between leaves (as Belk & Brown do) 
as wel l as the internal nodes of trees, which is based on an encoding of binary trees described 
in 
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Unfortunately, we have not been able to leverage our algorithm into a closed form ex- 
pression for the growth function or the corresponding generating function. However, an 
analysis of our data indicates that the series does not correspond to a simpl^ rational, al- 
gebraic or differentiably-finite generating function (satisfying a linear ordinary differential 
equation with polynomial coefficients — see ji^). It is entirely possible that the generating 
function lies outside these classes of functions and that no reasonable closed form solution 
exists. See ji], 0] for examples of problems with similar (but far simpler) recurrences that 
have surprisingly complicated solutions. 

The paper is organised as follows. In Section |2] we describe the first algorithm, which 



ie satisfying an equation of low order which has coefficients of low degree. 
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computes in exponential time the coefficients of tlie growtli and geodesic growtli functions of 
any group with an efficient solution to a certain geodesic problem. We apply this to Thomp- 
son's group F and display our results. In Section [3] we describe the second, polynomial time 
algorithm which we have used to compute the first 1500 terms of the growth function of F. 
We describe the forest diagram construction, how it leads to our enumeration algorithm, 
and give the results. We then prove an upper bound for the growth rate. In Section H] 
we summarise our findings, and in Appendix |X] we give more detailed pseudocode for our 
algorithms. 

We work with the presentation 

for Thompson's group F. We refer the reader to ^ and Q for an introduction to Thompson's 
group. 

The authors thank Jose Burillo, Sean Cleary, Martin Kassabov, Manuel Kauers, Francesco 
Mattucci, Buks van Rensburg and Jennifer Taback for fruitful discussions and ideas, and the 
anonymous reviewer for their careful reading and very helpful feedback. 



2. Algorithm A 

We begin with the following lemma, due to the third author and van Rensburg in [2l|, 
which arises in the context of randomly s amp ling self-avoiding walks and polygons using a 
generalisation of the Rosenbluth method [l8|, |22|] . In particular, it allows us to enumerate 
objects without having to use a unique construction. 

Definition 1. Let G be a group with finite generating set X. Given a word w in the 
generators of the group define 

d_{w) = {xeX \ £{w) > £{wx)} 
d+{w) = {x G X I l{w) < l{wx)} 
do{w) = {xeX \ i{w) = i{wx)}, 

where i{w) is the geodesic length of the element represented by w. These are the subsets 
of the generators that shorten, lengthen and do no change the geodesic length. Note that if 
the group only has relations of even length (as is the case for F) then do{w) = 0. 

Lemma 2 (From [2l[). Let r„ be the set of all geodesic words of length n. Given a word w 
from this set, let Wi be its prefix of length i. The size of the sphere of radius n (ie the number 
of distinct elements whose geodesic length is n) is given by 
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Proof. We prove this result by induction on n. When n = 1, the set of geodesies is just the 
set of generators and so expression is true. 

Let g be an element of S{n) and let Tn{g) be the set of geodesic paths from the identity 
to g. It then suffices to show that 

Every geodesic ending at g can be written as the product of a geodesic in r„_i and a 
generator. Thus we can write 

1 """^ 1 

The inner sum is zero when the set Tn-iigx^^) is empty. The induction hypothesis implies 
that the inner sum is equal to 1 when the set is non-empty (ie when £{gx~^) = n — 1). 
Exactly of the sets are non-empty and so the result follows. □ 

2.1. Description of the algorithm 

The above lemma allows us to compute the size of S{n) much more efficiently than 
brute-force methods. The time complexity is proportional to the size of r„ and the time 
to compute d±{w). The memory required is significantly reduced; we only require space to 
compute d±{w) and the current geodesic word. This is substantially better than brute force 
methods; we do not have to store normal forms for all the elements in S{0), . . . S{n). 

Suppose that G has an inverse closed generating X set of size m, and that there is an 
algorithm to compute d-{u), d+{u) for any geodesic u in this generating set. Fix an order on 
the generators of X (this also fixes a lexicographic order on the set of all geodesic words). 
Then the following algorithm computes the number of elements of length n. 



Algorithm A. Outputs the number of elements in the nth sphere. 
Set a counter to 0. 

Starting from the empty word, run through all geodesies of length n. 

Given the current geodesic, m, compute \d_{ui)\ for each length i prefix of m, and add 

nr 1 1. / M to the counter. 

By Lemma [2] the final value of the counter is the number of elements in the sphere of radius n. 
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There are a number of ways of running through the hst of all geodesies; perhaps the 
simplest is to traverse all the geodesies in lexicographic order using a recursive depth-first 
algorithm (see the pseudocode for ComputeGeodO and ComputeSphereO in Appendix lA.ip . 
It can also be done using a slightly more involved iterative procedure that given a geodesic 
finds the next geodesic in the lexicographic ordering (see the pseudocode for NextGeodO 
also in Appendix IA.1|) . 

Note that the above algorithm is easily modified to obtain the number of geodesies of 
length n: rather than adding nr=i \d counter each time, simply add 1. 

Proposition 3. The time complexity of Algorithm A is 0(|r„|), multiplied by the time to 
compute d+ at each step. The space complexity is linear plus the space required to compute 
d-,d+. 

Proof. As we run through all geodesies u, we only need space to store the current geodesic 
word, which requires linear space, plus the space required to compute d_,d^. Since we must 
consider all geodesies, the time is proportional to the number of geodesies in |r„|, multiplied 
by the time to compute d_, d+ at each step. □ 

So the time and space complexity of the algorithm depends on the complexity of com- 
puting d+ and d_. This naturally leads to the geodesic problems described in the next 
section. 

2.2. Geodesic problems 

Algorithm A gives a memory-efficient procedure to compute growth and geodesic growth 
for any group that has an efficient solution to the following problem: 

Problem 1. Given a geodesic u and generator x, determine if x G d^{u),dQ{u) or d_{u). 

Note that if all the relators in R have even length then these are the only two possibilities, 
since if x G do{u) means there must be an odd length relator. So the problem turns into the 
decision problem: 

Problem 2. Given a geodesic u and generator x, decide if x G (i+(M). 

If {G,X) has a polynomial time and space algorithm to answer Geodesic Problem 1, 
then Algorithm A computes both f[n) = \S{n)\ and g{n) = |r(?T,)| in exponential time, 
but polynomial space (assuming g grows exponentially). In [oj the authors consider these 
problems in more detail, and prove that they imply a solvable word problem for groups whose 
set of relators is countably enumerable. Note that if a group has an efficient solution to its 
word problem, then a naive brute force computation of /(n) and g{n) would still require 
an exponential amount of memory (assuming the functions / and g are exponential), so a 
polynomial space algorithm is a marked improvement on this. 
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2.3. Thompson's group F 

For Thompson's group F, one can use the geodesic length formula of Fordham 13| to 
compute the length of any word in polynomial time and space. There are now other similar 
formulations, and we note those due to Guba [l^ and Belk & Brown [l|. We chose to base 
both of our algorithms on the forest diagrams of Belk & Brown as we found them the easiest 
to implement. We note that there is a simple bijection between this representation and 
binary tree pairs. It is possible to develop similar algorithms based on the Fordham and 
Guba formulations, but we have not pursued these possibilities because we believe that they 
will give very similar results. 

We start our description of the algorithm by first explaining the forest-diagram length 
formula of Belk & Brown (we refer the reader to the original paper [li] for a fuller explanation). 
In Figure [1] we describe some of the terminology associated with this formula. 




.A 



A A A /A V V 



Figure 1: (left) A caret is a inner node of a binary tree together with the two out-going edges, (middle) A 
binary tree consisting of 6 carets and so 6 inner nodes and 7 leaves, (right) A pair of forests with a pair of 
common carets highlighted; such a diagram is not admissible. 



Consider the forest diagram in Figure |2] (which is Example 4.2.8 from It consists of 

.jC\A.k A 

LINILNILLIRXNI 
I ILLLILRRRRXI I 

V ^' V 

Figure 2: A forest diagram. The gaps are labelled according to a simple set of rules. The length of the 
corresponding element of Thompson's group is computed by considering the label-pairs in each column. In 
this case the element has length [2, 2, 2, 2, 2, 4, 2, 1, 1, 1, 2, 2, 4, 2] = 29. 

two rows of forests of binary trees, above and below two rows of labelled gaps. One (possibly 
empty) tree in each forest is pointed and the diagram cannot contain "common carets" (see 
Figured]). A pair of carets in a given column are considered to be common carets, if they are 
both immediate neighbours of the gaps. Further, the extreme left and right columns of the 
diagram cannot be empty. Each element of F corresponds uniquely to such a forest diagram 
with no common carets. 

The labels of the gaps are determined by the structure of the forests. We found it 
convenient to make a slight modification of the original labelling rules; we label gaps by 
applying the following rules in order: 
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(i)- 


"L" 


if it is left of the pointer and exterior (outside a tree), 


(ii). 


"N" 


if it is immediately left of a caret and interior (inside a tree), 


iii). 


"X" 


if it is immediately left of a caret and exterior. 


iv). 


"R" 


if it is right of the pointer and exterior. 


(v). 




if it is interior. 



Belk & Brown give "N" and "X" the same label, however the above labelling means that 
"L", "R" and "X" are always exterior, while "I" and "N" are always interior. The length of 
a group element represented by a diagram u is then 

cScolumns 

where W{c) is the weight of a column as given in Table 1. We refer to this sum as the weight 
of a given forest diagram. 



o 


I 


N 


L 


R 


X 


I 


2 


4 


2 


1 


3 


N 


4 


4 


2 


3 


3 


L 


2 


2 


2 


1 


1 


R 


1 


3 


1 


2 


2 


X 


3 


3 


1 


2 


2 



Table 1: The weight of a column is given by the above simple function of the gap labels. 

Our table differs from that in Belk & Brown pj also in that our table entries include the 
contribution from the number of carets, whereas Belk & Brown add this contribution sepa- 
rately. 

It should be noted that labels are defined very locally; to label a given column we do 
not need to know the labels of columns far away. While this is not crucial for Algorithm A, 
it is critically important for Algorithm B as it allows us to build a diagram with its labels 
column-by-column. This will be discussed further below. 

2.4- Implementing Algorithm A for Thompson's group F 

Propositions 3.3.1 and 3.3.5 of [ij describe how multiplication by each of the generators 
and their inverses changes a forest diagram. Multiplying a reduced forest by Xq ^ simply 
moves the top pointer one tree to the right or left which can be done in constant time. 
Multiplying by Xi adds a single caret to the roots of two adjacent trees, then canceling 
common carets if they exist. Multiplying by either adds or deletes a caret in the bottom 
or top forests, and then cancels common carets if they exist. Note that after multiplying by 
xf^ at most a single common caret exists. 
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So a word of length n produces a forest of at most n carets, so we can store the forest 
diagram for such a word in space 0{n). We can turn this into an algorithm to compute the 
geodesic length of an element of F as follows: 



Algorithm GeodesicLengthF. Computes the geodesic length of an element in F 
Input a word of length n in the generators x^^,xf^. 

Start with the empty forest diagram (no carets, and both pointers at the position). 

For each letter of u, redraw the current forest diagram by multiplying by this letter. To do 
this one must scan through the stored diagram and possibly add or delete a caret; this takes 
0{n) operations. 

When all letters are read, compute the weight of the resulting diagram by labelling it (which 
takes 0{n) operations) then read the geodesic length off from the table (which takes 0{n) 
operations). 



Proposition 4. One can compute the geodesic length of a word in F in time 0{'n?) and 
space 0{n). 

Proof. We store the forest diagram as a pair of two trees, and use depth-first search to run 
through the trees. The size of the tree is 0{n) and it takes 0{n) steps to scan through and 
reduce a pair of common carets. So as we run through each letter of u we perform 0{n) 
steps, so all together 0{n?). The final step takes 0{n) to label the forest diagram and 0{n) 
to compute the weight. □ 

The above algorithm allows us to quickly compute the geodesic length of any element of F 
(and so d±)\ it formed the basis of our implementation of Algorithm A. This then allowed us 
to calculate both the growth function and geodesic growth function for Thompson's group F 
up to length 22. See Table 3. Since F has exponentially many geodesies (it has exponential 
growth) the time for Algorithm A is exponential, but space is 0{n). 



In [16[ Cuba proves that the growth rate of the number of elements in Thompson's 
group F is bounded below by = 2.61803 . . . , which also serves as a lower bound for the 
growth rate of geodesies. We can use our data to obtain upper bounds for both growth and 
geodesic growth. 

Since every element of length n + m is the concatenation of some element of length n and 
some element of length m, it follows that f{n + m) < f{n)f{m) and so the sequence f{n) 
is submultiplicative. The same is true for the sequence g{n), which counts the number of 
geodesies. Fekete's lemma j26| states that if a sequence a(n) is submultiplicative, then the 
sequence a(n)^/"' is monotonically decreasing, which means that for any fixed n, the number 
a(nY^^ is an upper bound on the limit of a{nY^^. From this we obtain the following: 
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12 


431238 


556932 
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4 


4 


13 


1180968 


1588836 


2 


12 


12 


14 


3225940 


4507524 


3 


36 


36 


15 


8773036 


12782560 


4 


108 


108 


16 


23809148 


36088224 


5 


314 


324 


17 


64388402 


101845032 


6 


906 


952 


18 


173829458 


286372148 


7 


2576 


2800 


19 


467950860 


804930196 


8 


7280 


8132 


20 


1257901236 


2255624360 


9 


20352 


23608 


21 


3373450744 


6318588308 


10 


56664 


67884 


22 


9035758992 


17654567968 


11 


156570 


195132 









Table 2: Enumeration of group elements and geodesies of Thompson's group F. 



Proposition 5. The growth rate 7 := lim„_j.oo for the growth function of Thomp- 

son's group F satisfies the bounds 

— = 2.61803... < 7 < 15(22)1^/22 ^ 2.8349398... (2) 

Similarly, the growth rate /i := lim„^oo |r„|^/" for the geodesic growth function of Thompson's 
group F satisfies the bounds 

^^y^ = 2.61803 ...< /i < |r22r/'' = 2.92257... (3) 

In the next section we will sharpen the upper bound for the growth of elements consid- 
erably. 



3. Algorithm B 

In this section we describe a second algorithm to compute the number of elements in the 
sphere of radius n in Thompson's group F with standard generating set; this algorithm runs 
in polynomial time and space. 

Recall the forest diagram definition from the previous section. We count forest diagrams 
by their length using a column-by-column construction. Such constructions have been used to 
great effect in combinatorics and statistical mechanics yielding closed form solutions (for ex- 
ample 25, 2!) , polynomial time algorithms (for example jsf) and exponential time algorithms 



that are exponentially faster than brute force methods (for example (llj). The construction 
we use here gives rise to a system of recurrences which is rather cumbersome. As such, we 
do not give these recurrences explicitly, but instead iterate them using Algorithm B. 

An incomplete forest diagram can be thought of as a forest diagram with all rightmost 
columns deleted after some point. In order to append a column in a "legal" way we only 
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need to keep track of the labels of the rightmost column and two numbers which measure 
how far the top and bottom trees are from being "complete" . 

3.1. Encoding and counting binary trees 

Since the forest diagrams consist of a pair of sequences of binary trees, the starting point 
for our enumeration algorithm is to find an efficient encoding of binary trees. Binary trees 
are counted by the Catalan numbers and there is a large number of encodings. We have 
chosen one that naturally reflects the column-by-column construction used. 

We start by labelling leaves and internal nodes of the trees by "N" , "I" , "n" and "i" as 
shown in Figure [31 By convention we will not label the left-most leaf. 

/ \ X X 

N I n i 

Figure 3: We label leaves of a binary tree by "N" and "I" as shown. Similarly we label the internal nodes 
by "n" and "i" . By convention we always label the root node "n" . 

Reading these labels from left to right gives a word in the alphabet {n, N, i, 1} which 
starts with an "n" and alternates lower and upper case letters. The tree in Figure H] is 
encoded by the word "nInNilnNnlil" . 




X nl riN il nN nl il 

Figure 4: The tree above is labeled as shown. This gives rise to the encoding nlnNiluNnlil. For the purposes 
of the proof of Lemma [6] we prepend an N to this word. In our main algorithm, we will label the first leaf 
by "L" or "X" , and delete all of the lower case labels. 

This labelling was used in Jl2| and they proved that each tree has a unique encoding. 
The following lemma is Theorem 4.4 in that paper with modified notation. 

Lemma 6. A word, w, encodes a non-empty tree if and only if 

(i) . it starts with "n", 

(ii) . ends with "I", 

(iii) . alternates lowercase and uppercase letters, 

(iv) . all of its prefixes u satisfy \u\n+ \u\j\j > \u\j^+ \u\j (where \u\y denotes the number of 

occurrences of "y" in u) 
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(v). \w\n+\w\j^= \w\j^+\w\j. 

Proof. For the purposes of the proof we prepend an initial "N" to all encoding words. Call a 
word on {n, N, i, 1} admissible if it has an initial "N" and it satisfies the 5 conditions above 
when the initial "N" is deleted. 

Consider the mapping ipk that associates a binary tree of k carets to the word of length 
2k + 1 obtained by starting with an "N" and then reading the labels from left to right (see 
Figure Hj). We show by induction on k that ipk is bijective onto admissible words of length 
2k + 1. 

For k = 1, this is evident, there is a unique tree with one caret, which is mapped by tpi 
to the unique admissible word of length 3, namely "Nnl". 

Assume that the bijection holds for k > 1 and we will show that it also holds for k + 1. 
First we check that the image w of a tree t with k + 1 carets is admissible. Let t' be a tree 
obtained from t by deleting two leaves adjacent to the same caret c (so c becomes a leaf in 
t') and let w' = ipkit')- As seen in Figure w is equal to w' where an upper-case letter 
"A" is replaced by the 3-letter factor "Nal" , where "a" is the lower case of "A" . Since w' is 
admissible (by induction), it follows that w is also admissible. 




(N) N I (N) '^JsT I) I 
(N) nN il (N) nN nl il 



Figure 5: The effect of expanding a leaf with label A e {N, 1} into a caret is to replace "A" by "Nal" in the 
encoding word, where "a" is the lower case of "A" . 

Now to prove that ipk+i is bijective consider an admissible word w of length 2k + 3 and 
we will show that it has a unique pre-image. Let W be the word obtained from w by keeping 
only its upper-case letters. Since W starts with "N" and ends with "I" it must contain a 
factor "NI" . This lifts to a 3-factor "Nal" in the word w. Replacing this 3-factor by a single 
letter "A", where "A" is the upper-case of "a", we obtain a word w' of length 2k -\- 1. It 
follows that w' is also admissable. 

Moreover one may verify (see Figure [5]) that any pre-image t of w must be obtained from 
a pre-image t' of w' where the leaf corresponding to "A" is replaced by a caret with two 
leaves below it. Since there is a unique pre-image of w' (by induction), there is also a unique 
pre-image of w. □ 

We can now count binary trees by counting admissable words in this alphabet. Define 
the size of an even-length word to be its half-length. Following from condition (iv) above 
we define the excess of a word, w, to be the quantity |(|w|Ar-|- — 1^1/ — \w\i). So 
appending "nN" increases the excess by 1, appending "nl" or "iN" does not change the 
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excess and appending "il" decreases the excess by 1. Further, appending any of these pairs 
increases the size by 1.. 

Corollary 7. A word encodes a nonempty binary tree if and only if its excess is zero. 

Proof. This follows from condition (v) above. □ 

Given an even length word u and its excess we can decide which pairs can be legally 
added to the word — in fact there is little restriction other than when the excess is 0, in 
which case one may not append iN or il since this violates condition (iv). Thus one can 
construct a recurrence satisfied by c^^h, the number of valid words of length 2i (size i) and 
excess h. 

• There is a single valid code word of size 1, namely "nl". Hence ci,o = 1- 

• Any word with excess equal zero can be extended by 

— appending "nN" which makes its excess 1, or 

— appending "nl" which leaves its excess 0. 

• Any word with excess strictly greater than zero can be extended by 

— appending "nN" which increases its excess by 1, or 

— appending "nl" which leaves its excess unchanged, or 

— appending "iN" which leaves its excess unchanged, or 

— appending "il" which decreases its excess by 1. 

This translates into the following recurrence. 

ci,o = 1 

Jq_i,o + Q_i,i iih = 

Cih= \ 

+ 2Q_i,^ + Q_i,fe_i Otherwise 

While this recurrence can be solved explicitly, we show how it may solved iteratively in 
order to help explain our main result. Notice that the algorithm that we use to iterate this 

recurrence does not "look backward" and compute c^^h in terms of C£_i^;j and Q-i,/i±i, rather 
it "looks forward" and determines which coefficients Q+i^/^/ gain contributions from q^/j. Of 
course this algorithm (and our main result) could be rewritten to be "backward looking" 
but the authors feel that this way follows more naturally from the construction. 



Algorithm CountBinaryTrees. Outputs the number of binary trees of a desired size. 

Input maximum size (half-length) N and set size £ = 0. 

Set words (1, 0) = 1 — words (£, h) has to store the number of words of size £ and excess h. 

12 



(4) 
(5) 



At the current size £ run through all possible values of the excess h. 

If the current value of h is zero then (because we can append "nN" and "nl" ) 

• increment words(£+ 1, 1) by words(£, 0), and 

• increment words +1,0) by words (£, 0). 
Otherwise (we can append "nN" , "nl" , "iN" and "il" ) 

• increment words (£ + l,h+ 1) by words (£, h), and 

• increment words (^+ l,h) by 2words(£, /i), and 

• increment words (£ + l,h — 1) by words (£, h). 

After running through all possible values of the excess, increment the size € by 1 and output 
words (£, 0) — the number of valid complete words of the current size. 

Keep iterating until the desired size is reached. 



Notice that we must iterate through all values of the size and excess in order to compute 
the final enumeration. Since the excess is at most the size for each (possibly incomplete) 
forest diagram, this means that the algorithm requires 0{n^) time. Once we reach a given 
size we can forget all the data from shorter lengths. In fact we only need to remember the 
current size and the next size. This means the algorithm requires 0{n) space. 

For the main algorithm below, the labels "I" and "N" correspond precisely to the labels 
used in the Belk & Brown geodesic length formula. The lower case labels can then be ignored 
and we instead keep track of the excess and note that when appending an "N" the excess 
either stays constant or increases by 1, and when appending an "I" the excess either stays 
constant or decreases by 1. Note also that the size (which was the half-length when lower- 
case letters are considered) coincides with the length when the lower-case letters are ignored, 
so we will use the terminology of length (instead of size) from now on. 

3.2. Transitions in the upper forest 

We now describe how to construct the upper half of the forest diagram column-by-column; 
the lower half is encoded by an identical procedure. We commence every forest with an empty 
column labelled "L". The possible transitions from a column with a given label to the next 
depend only on that label, the excess and whether one is to the left or right of the pointer. 
Note that the excess can only be non-zero if the label is either "I" or "N" . We now describe 
this transitions in detail. 

We will write the last column of the current upper forest as a triple of the label, a flag 
indicating whether it is left or right of the pointer, and the excess. For example, a diagram 
ending in an "L" must be left of the pointer, and have excess and so we denote it by 
(L, left, 0). Call this triple the state of the upper diagram. 
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A ^ /. ' /k 
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Figure 6: Consider a diagram ending in "L" . We can append new columns to the end of this diagram as 
shown above. In particular, an "L" may be followed by an "L" or "R" . If we append an "X" then we must 
commence a tree at the next column (indicated by the shaded region). Similarly we may start a tree in 
exactly 4 different ways; either with an "I" or an "N" and with the tree pointed or not. 

• If the upper forest ends in an "L" then it must be in state (L, left,0). See Figure [61 
We may append 

— "L", which keeps the diagram in state (L, left, 0), 

— "R", which means the pointer is now to the left of the current position, so the 
diagram is now in state (R, right, 0), 

— "X" , so again the pointer has been passed, and the diagram is in state (X, right, 0) , 

— "I", which starts a tree with codeword "nl" so the state becomes (I, left, 0), 

— "I" and put the pointer on the tree (just started by this "I") and so move to the 
state (I, right, 0), 

— "N", which starts a tree with codeword "nN" so the state becomes (N, left, 1), 

— "N" and put the pointer on the tree (just started by this "N") and so move to 
the state (N, right, 1). 

/ /% 

R R R X XI X N 



Figure 7: If a diagram ends in "R" then we can either append another "R" or an "X" . On the other hand, 
if the diagram ends in "X" then we must start a tree with either "I" or "N" . 

• If the upper forest ends in an "R" then it must be in state (R, right, 0). See Figure [3 
We may append 

— "R" which keeps the diagram in state (R, right, 0), 

— "X" which adds a gap immediately to the left of a new tree, and moves to 
(X, right, 0). 
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• If the upper forest ends in an "X" then it is in state (X, right, 0) and we must start a 
new tree. See Figure El We may append 

— "N" and move to the state (N, right, 1), 

— "I" and move to the state (I, right, 0). 

r.\\ / 

II IN 

l.\ . ^\ . /I 

I L I R IX 

Figure 8: If a diagram ends in "I" of excess then we can continue the tree with an "I" or an "N" as 
described in Section [Ql (note that an internal node labelled by "n" will be created). If we are to the left of 
the pointer then we can append an empty column labelled "L" . If we are to the right of the pointer then we 
may append an empty column labelled "R" or "X" . 

• See Figure [H If the diagram is in state (I, left, 0) then we may 

— continue the current tree by appending "N" or "I" and so move to (N, left, 1) or 
(I, left, 1), 

— finish the tree by appending "L" arriving in state (L, left, 0). 

• Similarly, if the diagram is in state (l, right, 0) then we may 

— continue the current tree by appending "N" or "I" and so move to (N, right, 1) 
or (I, right, 0), 

— finish the tree by appending "R" or "X" and so arrive in state (R, right, O) or 
(X, right, 0). 

• Finally, if the diagram is in state (N, p,h), (I,p, h) with h > and p = left or right 
then we are in the middle of constructing a tree (as described in Section [3.ip . Note 
that one cannot have label "N" with excess zero, nor may we change p. We may now 

— add "N" and increase the excess by 1, moving to (N, p,h+ 1), 

— add "N" leaving the excess unchanged giving (N, p,h), 

— add "I" leaving the excess unchanged giving (I,p,h) 

— add "I", decreasing the excess by 1, moving to (l,p,h — 1) 
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Again, these constructions are easily turned into an algorithm and we refer the reader to 
the pseudocode for UpdateLeft (state) and UpdateRight (state) in Appendix IA.2I 

One might also attempt to write out a set of recurrences, but they are quite cumbersome; 
we now require several sets of coefficients since one must keep track of the excess, the 
final label and whether one is to the left or right of the pointer. This becomes extremely 
unwieldy for the full algorithm when one keeps track of the states of the upper and lower 
forests. Because of this we were unable to write out the full recurrences; instead we keep the 
recurrences implicit and iterate them using our algorithms. 

3. 3. The full algorithm 

The full algorithm is very similar to Algorithm CountBinaryTrees, except that now 
the states of the upper and lower forests must be remembered and all possible pairs of 
transitions must be determined. The set of pairs of transitions is very nearly the Cartesian 
product of the set of transitions for the upper forest and the set of transitions for the lower 
forest; the only restriction is that we must prevent the creation of common carets. 

A caret is created in the upper (lower) forest immediately above (below) a gap when a 
transition is made from a gap labelled "I" , "N" , or "X" to a gap labelled "I" (note that it is 
not possible to have a transition from "R" to "I"). Thus a pair of common carets is created 
when a transition is made from a pair of gaps labelled from a pair of gaps labelled by 

We iterate through the recurrence starting from a diagram that consists of a single empty 
column with gaps labelled by [^]. At each iteration we examine all the possible states and 
consider all possible transitions from those states (avoiding common-carets) to new states 
using the rules described above. The contribution of diagrams of the current length is then 
added to the appropriate total for diagrams with new states with weight updated according 
to the new labels and Table 1. A forest diagram is complete when it finishes with an empty 
column labeled by [^]. This should be seen to be very similar to Algorithm CountBina- 
ryTrees except that the transitions are more complicated. 

This enumerates all forest diagrams starting with an empty column labelled by and 
ending with another empty column labelled by [^]. This, unfortunately, is not the precise 
set of diagrams that Belk & Brown considered. They considered diagrams without any empty 
columns at either end; Proposition [H] shows how to correct the output of the algorithm. 



Algorithm B. Counts forest diagrams starting with , and ending with 



Input maximum length and set length ^ = 2. 

Set totals(2, (L, left, 0), (L, left, 0)) = 1 — this counter stores the number of diagrams of 
a given weight ending in a column with given labels and excess. 

At the current length ^ run through all the states of the upper and lower forests, a, r, for 
which totals (£, 0", r) ^ 0. 
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Let c = totals (£, a, r). This is the number of diagrams of length I that end in a column in 
state 0", r. 

Given the current state of the upper forest compute the new states that may be reached 
according to the rules in Section [3l2l Do similarly for the lower forest. 

For each pair of transitions (a', r') that does not create a common caret, compute the new 
length i' using Table 1, and increment totals (£', a', r') by c. 

When all the states of the current length have been examined, output the number of complete 
diagrams, namely totals(n, (R, right, 0), (R, right, 0))). 

Keep iterating until the desired length is reached. 



We give a more detailed account of this algorithm in pseudocode for CountForestDiagrams () 
in Appendix IA.2I This main routine relies on the routines UpdateLef t () and UpdateRight () 
to compute the possible transitions. The cost analysis of the algorithm is given below in 
Proposition [9l 

As noted above the forest diagrams enumerated by this algorithm begin with any positive 
number of empty columns labelled "[^]" and then end in any positive number of empty 
columns labelled "[^]", whereas those of Belk & Brown have no such empty columns. 
Thankfully we can easily obtain the correct count as follows. 

Let 

F{q) = h + hq + f2q' + ... 

be the formal power series with coefficients /„ = the number of forests of weight n with no 
leading "[^]" or tailing "[^]" columns. Let 

H{q) = h^q'^ + h^q^ + h^q^ + ... 

be the formal power series with coefficients hn = the number of forests of weight n which 
have a positive number (at least 1) leading "[^]" and a positive number of tailing "[^]" 
columns (which are counted by our algorithm). Note that ho = hi = h2 = hs = since such 
forests must have weight at least 4. Then 

Proposition 8. 

Fil)=(^jH{q) 

Proof. Let G{q) be the formal power series with coefficients Qn = the number of forests of 
weight n with no tailing "[^]" columns, but one or more leading "[^]" columns. Note that 
9o = 91 = since such forests have weight at least 2. Then gn = fn-2 + fn-4 + • • • since an 
empty column labelled "[^]" increases the length of a forest diagram by 2. That is, we have 
fn-2 with just one leading "[^]" , fn-4 with two leading "[^]" , and so on. 
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Thus 



foq'+fiq'+f2q'+f3q'+hq'+. • • 
+foq'+fiq'+f2q'+- • • 
+foq'+. . . 



{q' + q' + q' + ...)F{q) = - 



A similar argument shows that 



H{q) = G{q) 




from which the result easily follows. 



□ 



3.4- Cost analysis 

Proposition 9. Algorithm B computes the number of elements in the spheres of radius up 
to n in time 0{n^) and space 0{n^). 

Proof. The final generating function transformation takes constant time and space, so our 
main concern is the complexity of enumerating a diagram with at most n columns. Since 
each column has weight at least one, each element of the ball of radius n will be represented 
by such a diagram. 

When a new column is appended to a diagram the weight changes by at least 1 and at 
most 4. Thus incomplete diagrams of weight n will only contribute to incomplete diagrams 
of weights between n + 1 and n + 4. Once we have scanned through all diagrams of weight n 
and computed all possible transitions we will not need them again and they can be thrown 
away. In this way we only need to store information about diagrams of five different weights 
at any given time. Thus the algorithm needs to store O(n^) coefficients. During the n^^ 
iteration of the algorithm we scan through each of the O(n^) diagrams of weight n and 
compute the possible transitions; this takes O(n^) operations. Since a diagram of weight n 
may contain at most n columns the algorithm requires 0{n?) operations. □ 

3.5. Growth rate 

Using this algorithm we have been able to compute f{n) for < n < 1500 using modest 
computer resources. We give some of the resulting data in Table 2; the full sequence has 
been published as sequence A156945 in the Online Encyclopedia of Integer Sequences [23| . 
We coded the algorithm in gnu-c++ using the Standard Template Library map container to 
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n 


f{n) 


J(^)l/n 


f{n)/f{n-l) 





1 


O 


o 


1 


4 


4 


4 


2 


12 


3.464. . . 


3 


3 


36 


3.301... 


3 


4 


108 


3.223... 


3 


5 


314 


3.157... 


2.907. . . 


6 


906 


3.110... 


2.885... 


7 


2576 


3.070. . . 


2.843. . . 


8 


7280 


3.039. . . 


2.826... 


9 


20352 


3.011... 


2.795... 


10 


56664 


2.987... 


2.784. . . 


20 


1257901236 


2.850... 


2.688... 


50 


6015840076078706884412 


2.726. . . 


2.627... 


100 


5023 ■ ■ ■ 5868 


2.673. . . 


2.6184... 




43 digits 






200 


pl58 ■ ■ ■ 3328, 


2.645. . . 


2.618034. . . 




85 digits 






500 


7798 ■ ■ ■ 8648 

V ' 


2.628. . . 


2.6180339887498949. . . 




210 digits 






1000 


7579 • ■ ■ 7676 

" V ' 


2.623. . . 


differs from ^^^^ by about 10~^^ 




419 digits 






1500 


7367 ■ ■ ■ 9566, 


2.62167... 


differs from by about 10""^^ 




628 digits 







Table 3: Table of the number of elements of geodesic length n. By Fekete's Lemma, /(n)^/" is an upper 
bound on the growth rate. We note that the ratio of successive terms appears to converge rapidly to Cuba's 
conjectured value for the growth rate; unfortunately we have not been able to prove that it converges. 



store the coefficients indexed by their label^ and we used the CLN number hbrarjjf] to handle 
large integer arithmeticQ. 

As noted in the previous section, the sequence f{n) is submultiplicative and so Fekete's 



lemma [26l implies that the growth rate, lim„_>.oo /(/i)^^"', exists and is bounded above by 
f^nfln for any n. Our data then gives a sequence of rigorous upper bounds on the growth 
rate and combining with Cuba's lower bound [16| gives the following result. 



^We also implemented the algorithm using hashmap; this used roughly the same amount of computer 
memory. Despite being theoretically faster than map we found that it slowed the algorithm in practice. 
■^Available from http://www.ginac.de/CLN/ at time of writing. 

*An alternative approach would have been to use modular arithmetic and reconstruct the results using 
the Chinese remainder theorem. Since f{n) grows exponentially with n, this approach would require us to 
run the algorithm roughly 0{n) times using different primes. This would appear to be a way of adapting 
this algorithm to run on a cluster of computers. 
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Proposition 10. The growth rate 7 := lim„_^oo /(^)"^''" of Thompson's group satisfies 

= 2.61803 . . . < 7 < /(1500)^/^^°° = 2.62167 . . . 



(6) 



We have plotted the sequence of upper bounds, /(n)^/", together with the lower bound 
in Figure [9] (left). It is not unreasonable to extrapolate these upper bounds and reach the 
following conjecture: 

Conjecture 11. The growth rate of Thompson's group F with standard generating set 

IS 




0.01 
^-1 



Figure 9: (left) A plot of f(nY/^ versus n for 50 < n < 1500. For comparison we have also plotted 
Cuba's lower bound of '^'^^ ■ (right) A plot of the ratios f{n)/f{n + 1) for 50 < n < 1500, and Cuba's lower 
bound. For large n the ratios are indistinguishable from Cuba's lower bound. 

To further support this conjecture we have examined the sequence (/(2n)//(n))^/" which 
must converge to the growth rate and we observe that it converges extremely rapidly to 
Cuba's lower bound. Additionally, in Figure [H] (right) we have plotted f{n)/f{n— 1). The 
limit of this ratio, if it exists, must be the growth rate. While we have not been able to prove 
convergence it does appear to converge rapidly and is nearly indistinguishable from Cuba's 
bound. Note that Cuba computed this ratio for n = 9 in [ig], to obtain a conjectured upper 
bound of 2.7956043 . . .. 



Numerical analysis of the series [10[ indicates that the corresponding generating function 
F{z) = ^ f{n)z^ has an isolated simple pole at implying that 

= A + exponentially smaller terms (7) 

with A 8.02374.... The first correction term appears to be approximately 0(2.432"). 
We estimated this term by studying the asymptotics of the generating function (1 — 3^; + 
z'^)F{z)\ this polynomial factor cancels the dominant contribution to the asymptotics from 
the (observed) simple pole at 
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4. Outlook 



In this paper we have described two algorithms for computing the size of the sphere 
of radius n of Thompson's group with standard generating set. The first of these runs in 
exponential time and polynomial space, is potentially generalisable to other groups, and also 
computes the geodesic growth series. The second runs in polynomial time and space and we 
have used it to compute the first 1500 terms of the growth series. We then used Fekete's 
lemma to compute upper bounds for the growth rate from this data. This work suggests 
that the growth rate of F is exactly equal to '^^^ ■ This strongly indicates that the normal 
forms described in 15|, ll6|] have very nearly geodesic length. More precisely, we believe that 
the word length of the normal form of a typical element differs from its geodesic length only 
by 0(1). 

We analysed the sequence f{n) using our data and we have found some indication (us- 
ing series analysis techniques such as differential approximants 17|) that the corresponding 
generating function contains square-root singularities and so is unlikely to be rational. Fur- 
ther, despite having the first fifteen hundred terms of the sequence, we have been unable 
to conjecture a rational, algebraic or differentiably fini te g enerating function. In particular 



we used Guess package developed by Manuel Kauers to search through a wide range 
of possible recurrenceC|. Unfortunately this search did not find any candidates. While this 
does not prove that the generating function lies outside these classes, it does rule out the 
possibility that the generating function is simple (satisfying a recurrence of low degree or 
order) . 

It is relatively straightforward to extend Algorithm B to generate elements of a fixed 
geodesic length uniformly at random in polynomial time. Algorithm A was derived from an 
approximate enumeration algorithm 2l| which can also be used to sample large elements of 
the group. We are currently investigating how these random generation techniques might 
be used (in the same spirit as |5|]) to explore properties of the group that are still beyond 
analytic techniques such as its amenability. 
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A. Appendix: Pseudocode 

Throughout the appending we will use the following notations: 

• X i— y: set the variable x to value y, 

• X = y: the boolean operation that returns "true" if x and y are the same and otherwise 
returns "false", and 

• x+ = y: increment the variable x by y. 

We use X ^ y to distinguish the assignment of a value to a variable from the test for equality. 
While we could use x x + y instead of x+ = y we feel that the later easier to read when 
using long and descriptive variable names. 

A.l. Algorithm A 

The function ComputeGeod recursively outputs all geodesic words of length A^. Of course 
this function has the drawback that it generates a list of the geodesies that must be stored 
either in memory or on disk before it can be processed to give the number of elements. 
The following function avoids that problem. Calling ComputeSphere(e, A^), where e is the 
empty word, will return the size of the sphere of radius A^. It recursively computes the 
geodesies of length A^ starting with prefix w and instead of storing them in a list, it returns 
the sum of their contributions. Alternatively a non-recursive procedure NextGeodesicO is 
given below. This computes the first geodesic after w of length at most A^. 
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ComputeGeodCw ,iV) — output all geodesies of length N with prefix w. 
Input: Geodesic word w, Maximum length N 
if \w\ = N then Output: w 
else 

|_ foreach x e d+{w) do ComputeGeod {wx, N) 



ComputeSphereCwjA/^) — Find the size of S{n) 
Input: Geodesic word w, Maximum length N 

s ^0; 

if \w\ = N then 

|_ S ^ 11^=1 \d-{Wi)\ 

else 

|_ foreach x e d+{w) do s+ =ComputeGeod {wx, N) 
Output: s 



NextGeodesicCw;, A^) — find the first geodesic after w of length at most N 
Input: Geodesic word w, Maximum length 
// For this algorithm let d^{w) = when \w\ = N . 
if d+{w) 7^ then 

X ■<— first generator in d+{w) 
Output: wx 

while d^{w) = do 
X last letter of w 
Delete last letter of w. 
a X ^ last generator in d+{w) then 

y ^ first generator after x in d+{w). 
|_ Output: wy 

else if \w\ = then 
|_ Output: "No more geodesies" 
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A. 2. Algorithm B 

We give the pseudocode for our main algorithm divided into three different functions. 
Given the current state of the upper (or lower) forest diagram that is left of the pointer, 
the function UpdateLef t () returns the possible states of the diagrams produced by a valid 
transition (as described in Section [X^ above) . Similarly, given the current state of the upper 
(or lower) forest diagram that is right of the pointer, the function UpdateRight () returns 
the possible states of the diagrams produced by a valid transition. 

Finally, CountForestDiagrams () enumerates all forest diagrams according to their weight 
(the geodesic length of the group elements they represent). It starts from the empty diagram 
with gaps labelled by . At each iteration it runs through all the pairs of upper and lower 
states that have been reached and determines which pairs of states can be reached by valid 
transitions using UpdateLef t () and UpdateRight () . The transitions of the upper and lower 
forests are nearly independent of each other; the only restriction is that we avoid creating 
common carets and they are easily avoided. The weight of the diagram can then be updated 
using the information in Table 1 and the appropriate counters can be updated. At the end 
of each iteration we output the number of completed diagrams of the current weight (which 
are those ending in gaps labelled [^]. 



UpdateLef t (state) — Return the set of states that can be reached from the current 
state when left of pointer 

// State is left of pointer by assumption 
Input: State of half-column (label, left, h) 

// NewStates will be the set of states reached from the current state 
NewStates 0. 
if label = L then 

Add (L, left,0) to NewStates ; // another gap 

Add (N, left, 1), (1, left, 0) to NewStates ; // start a tree 

Add (N, right, 1), (I, right, 0) to NewStates ; // pointer & start tree 

Add (R, right, 0) to NewStates ; // pointer & gap 

Add (X, right, 0) to NewStates ; // pointer & gap and start new tree next 

if label = N or flabel = 1 and h > 0^ then 

Add (N, left, h + 1), (N, left, h), (l, left, h) and (I, left, h - 1) to NewStates ; 
// Continue current tree 

if label = I and h = then 

Add (N, lef t, 1) and (I, left, 0) to NewStates ; // continue tree 

Add (L, lef t, 0) to NewStates ; // finish current tree 

/* Note that one cannot finish a tree and immediately have the 
pointer, so the current state cannot be followed by a state 
labelled ' 'R' ' or ' 'X' ' */ 

Output: NewStates 
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UpdateRight (state) — Return the set of states that can be reached from the current 
state when right of pointer 

// State is right of pointer by assumption 
Input: State of half-column (label, right, h) 

// NewStates will be the set of states reached from the current state 

NcwStatcs ^ 0. 
if label = R then 

Add (R, right, 0) to NewStates ; // another gap 

Add (X, right, O) to NewStates ; // another gap, start new tree next 

if label = X then 

|_ Add (N, right, l) and (I, right, O) to NewStates ; // start a tree 

if label = N or (label = I and h> 0) then 

Add (N, right, h+ 1), (N, right, h), (I, right, h) and (I, right, h - 1) to 
NewStates ; // Continue current tree 

if label = I and h = then 

Add (N, right, 1) and (l, right, 0) to NewStates ; // continue tree 

Add (R, right, 0) to NewStates ; // finish current tree 

Add (X, right, 0) to NewStates ; // finish current tree, start new tree 
next 

Output: NewStates 
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CountForestDiagrams (M) — Count forest diagrams of weight at most M. 
Input: Maximum length M 

/* totals(n, o", r) stores the number of diagrams of weight n, with upper 
diagram in state a and lower diagram in state r. Initially all are 
zero except the following */ 
totals(2, (L, left, 0), (L, left, 0)) 1 
for n ^ 2 to M - 1 do 

foreach (c, r) with totals (n, cr, r) 7^ do 

// The following produces sets of new upper and lower forests 
if a is left of upper pointer then 
|_ upper-set •<— UpdateLeft (cr) 

else 

|_ upper-set UpdateRight (a) 

if r is left of lower pointer then 
|_ lower-set <(— UpdateLeft (r) 

else 

|_ lower-set ■<— UpdateRightCr) 

// Construct new diagrams in states a',T' from all possible pairs 

of transitions 
foreach {a', t') G upper-set x lower-set do 

//We must check new columns for common carets, 
if (a' .label — t' .label — l) and (a. label 1) and (t. label l) then 
|_ // reject new state as it produced common caret 

else 

// no common caret so keep new state with updated weight 
// Weight computes the change in weight using Table 1 
_ totals (n-|-Weight (a'. /a&e/, r'. label), a',T')-\- = totals(n, cr, r) 

|_ Output: (n-M,totals(n-M, (R, right, 0), (R, right, 0))) 
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